Search CORE

7 research outputs found

Automatic Machine Learning by Pipeline Synthesis using Model-Based Reinforcement Learning and a Grammar

Author: Cho Kyunghyun
Drori Iddo
Freire Juliana
Krishnamurthy Yamuna
Lourenco Raoni
Rampin Remi
Silva Claudio
Publication venue
Publication date: 01/01/2019
Field of study

Automatic machine learning is an important problem in the forefront of machine learning. The strongest AutoML systems are based on neural networks, evolutionary algorithms, and Bayesian optimization. Recently AlphaD3M reached state-of-the-art results with an order of magnitude speedup using reinforcement learning with self-play. In this work we extend AlphaD3M by using a pipeline grammar and a pre-trained model which generalizes from many different datasets and similar tasks. Our results demonstrate improved performance compared with our earlier work and existing methods on AutoML benchmark datasets for classification and regression tasks. In the spirit of reproducible research we make our data, models, and code publicly available.Comment: ICML Workshop on Automated Machine Learnin

arXiv.org e-Print Archive

Open Repository and Bibliography - Luxembourg

BugDoc: Iterative debugging and explanation of pipeline

Author: DE PAULA LOURENCO Raoni
Freire Juliana
Shasha Dennis
Simon Eric
Weber Gabriel
Publication venue: Springer Science and Business Media Deutschland GmbH
Publication date: 01/01/2023
Field of study

peer reviewedApplications in domains ranging from large-scale simulations in astrophysics and biology to enterprise analytics rely on computational pipelines. A pipeline consists of modules and their associated parameters, data inputs, and outputs, which are orchestrated to produce a set of results. If some modules derive unexpected outputs, the pipeline can crash or lead to incorrect results. Debugging these pipelines is difficult since there are many potential sources of errors including: bugs in the code, input data, software updates, and improper parameter settings. We present BugDoc, a system that automatically infers the root causes and derive succinct explanations of failures for black-box pipelines. BugDoc does so by using provenance from previous runs of a given pipeline to derive hypotheses for the errors, and then iteratively runs new pipeline configurations to test these hypotheses. Besides identifying issues associated with computational modules in a pipeline, we also propose methods for: “opportunistic group testing” to identify portions of data inputs that might be responsible for failed executions (what we call), helping users narrow down the cause of failure; and “selective instrumentation” to determine nodes in pipelines that should be instrumented to improve efficiency and reduce the number of iterations to test. Through a case study of deployed workflows at a software company and an experimental evaluation using synthetic pipelines, we assess the effectiveness of BugDoc and show that it requires fewer iterations to derive root causes and/or achieves higher quality results than previous approaches

Open Repository and Bibliography - Luxembourg

DataPrism: Exposing Disconnect between Data and Systems

Author: DE PAULA LOURENCO Raoni
Fariha Anna
Freire Juliana
Galhotra Sainyam
Meliou Alexandra
Srivastava Divesh
Publication venue: Association for Computing Machinery
Publication date: 10/06/2022
Field of study

peer reviewedAs data is a central component of many modern systems, the cause of a system malfunction may reside in the data, and, specifically, particular properties of data. E.g., a health-monitoring system that is designed under the assumption that weight is reported in lbs will malfunction when encountering weight reported in kilograms. Like software debugging, which aims to find bugs in the source code or runtime conditions, our goal is to debug data to identify potential sources of disconnect between the assumptions about some data and systems that operate on that data. We propose DataPrism, a framework to identify data properties (profiles) that are the root causes of performance degradation or failure of a data-driven system. Such identification is necessary to repair data and resolve the disconnect between data and systems. Our technique is based on causal reasoning through interventions: when a system malfunctions for a dataset, DataPrism alters the data profiles and observes changes in the system's behavior due to the alteration. Unlike statistical observational analysis that reports mere correlations, DataPrism reports causally verified root causes-in terms of data profiles-of the system malfunction. We empirically evaluate DataPrism on seven real-world and several synthetic data-driven systems that fail on certain datasets due to a diverse set of reasons. In all cases, DataPrism identifies the root causes precisely while requiring orders of magnitude fewer interventions than prior techniques

Open Repository and Bibliography - Luxembourg

AlphaD3M: An Open-Source AutoML Library for Multiple ML Tasks

Author: Castelo Sonia
DE PAULA LOURENCO Raoni
Freire Juliana
Ono Jorge
Rampin Remi
Santos Aécio
Silva Claudio
Publication venue
Publication date: 12/09/2023
Field of study

peer reviewedWe present AlphaD3M, an open-source Python library that supports a wide range of machine learning tasks over different data types. We discuss the challenges involved in supporting multiple tasks and how AlphaD3M addresses them by combining deep reinforcement learning and meta-learning to construct pipelines over a large collection of primitives effectively. To better integrate the use of AutoML within the data science lifecycle, we have built an ecosystem of tools around AlphaD3M that support user-in-the-loop tasks, including selecting suitable pipelines and developing custom solutions for complex problems. We present use cases that demonstrate some of these features. We report the results of a detailed experimental evaluation showing that AlphaD3M is effective and derives highquality pipelines for a diverse set of problems with performance comparable or superior to state-of-the-art AutoML systems

Open Repository and Bibliography - Luxembourg

AlphaD3M: Machine Learning Pipeline Synthesis

Author: Cho Kyunghyun
DE PAULA LOURENCO Raoni
Drori Iddo
Freire Juliana
Krishnamurthy Yamuna
Piazentin Ono Jorge
Rampin Remi
Silva Claudio
Publication venue
Publication date: 01/01/2021
Field of study

peer reviewedWe introduce AlphaD3M, an automatic machine learning (AutoML) system based on meta reinforcement learning using sequence models with self play. AlphaD3M is based on edit operations performed over machine learning pipeline primitives providing explainability. We compare AlphaD3M with state-of-the-art AutoML systems: Autosklearn, Autostacker, and TPOT, on OpenML datasets. AlphaD3M achieves competitive performance while being an order of magnitude faster, reducing computation time from hours to minutes, and is explainable by design

arXiv.org e-Print Archive

Open Repository and Bibliography - Luxembourg

Escalated Antipredator Mechanisms Of Two Neotropical Marsupial Treefrogs

Author: Caio V.
Cassio Z.
Danilo S.
Edmund D.
Jr.
Lourenco-de-Moraes
Luis Felipe
Mirco
Raoni
Ricardo
Rodrigo B.
Tadeu
Publication venue: 'The London Student Journal of Medicine'
Publication date: 13/11/2017
Field of study

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)The sequence and intensity of antipredator mechanisms may be displayed according to the risk of predation. We tested this hypothesis using two species of marsupial treefrogs from Brazil's Atlantic Forest. We observed Gastrotheca recava and G. megacephala displaying nine antipredator mechanisms and three types of defensive calls. These behaviours were displayed in an escalated sequence from motionless (passive behaviour) to biting (the most aggressive behaviour). This diversified set of antipredator mechanisms may be related to the interaction between predator and prey at the local scale. The escalated sequence of defensive behaviours should be considered in future studies on anuran-predator interaction.263237244CNPq [140710/2013-2, 405285/2013-2, 302589/2013-9, 483412/2010-4]Ecology Center at Utah State UniversityCAPES/FAPESCAPESFAPESP [2014/233887]Rufford FoundationHerpetologist's LeagueConselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP

Repositorio da Producao Cientifica e Intelectual da Unicamp

Neotropical freshwater fisheries : A dataset of occurrence and abundance of freshwater fishes in the Neotropics

Author: Abelha Milza Celi Fedatto
Abilhoa Vinicius
Aguirre Maldonado Windsor Efren
Albuquerque Bianca Weiss
Althoff Sergio Luiz
Alvarez-Pliego Nicolas
Alves Jonatas
Alvez Helen Jamille Fernandes Silva
Andriola Joao Vitor Perin
Angulo Sibaja Arturo
Arrolho Solange
Assis Juliana Camara
Auer Sonya K.
Azevedo-Santos Valter M.
Bailly Dayani
Balbi Thiago Jose
Barba-Macias Everardo
Barbosa Thiago Augusto Pedroso
Barp Elisete Ana
Barros Maria Claudene
Barufatti Alexeia
Bassar Ronald D.
Bastiani Marlos
Batista Gabriel de Avila
Begot Tiago Octavio
Bellay Sybelle
Benedito Evanilde
Benone Naraiana Loureiro
Bernardes Peronico Phamela
Bezerra Luis Artur Valoes
Bialetzki Andrea
Bigorne Remy
Birindelli Jose Luis Olivan
Bogoni Juliano Andre
Bono Alessandra
Borges Pedro Paulino
Borja-Acosta Kevin Giancarlo
Bornatowski Hugo
Botero Jorge Ivan Sanchez
Braga Silva Alline
Braga Lorrana Thais Maximo Durville
Braga Marcelo Renno
Braga Raul Renno
Brambilla Eduardo Meneguzzi
Brito Marcelo Fulgencio Guedes
Brosse Sebastien
Buenano-Carriel Martha
Caetano Dyego Leonardo Ferraz
Calvache Uvidia Evelyn Vanessa
Campos Vinicius Farias
Canas-Merino Mauricio
Canas-Rojas Diego
Cantagallo Devids Camila
Carmassi Alberto Luciano
Carmassi Giulianna Rondineli
Carneiro Lais
Carneiro Ronaldo Leal
Carrillo-Moreno Carolina
Carrillo-Moreno Carolina
Carvalho Pedro Hollanda
Casimiro Armando Cesar Rodrigues
Cassemiro Biagioni Renata
Catelani Paula Araujo
Cetra Mauricio
Contreras Palma Kamila
Costa Ana Paula Lula
Covain Raphael
Cruz-Ramirez Allan K.
Cunha Eduardo Ribeiro
Cunha Priscila de Oliveira
Cunico Almir Manoel
D'Bastiani Elvira
da Costa Fraga Elmary
da Cunha Cristina Jaques
da Graca Weferson Junio
da Rosa Clarissa Alves
da Rosa Marlon Ferraz
da Silva Campos Thiago Nascimento
da Silva Freitas Tiago Magalhaes
da Silva Goncalves Cristina
da Silva Ingenito Leonardo Ferreira
da Silva Andre Teixeira
da Silva Diego
da Silva Felipe Pessoa
da Silva Jislaine Cristina
da Silva Joao Fernando Marques
da Silva Juliana Paulo
da Silva Lucas Goncalves
da Silva Monica Andrade
da Silva Valeria Flavia Batista
da Silveira Prudente Bruno
da Silveira Tony Leandro Rezende
Daga Vanessa Salete
Dala-Corte Renato Bolson
Dary Eurizangela Pereira
Datovo Alessio
Dattilo Wesley
de Almeida Nobre Rodrigo
de Almeida Marcelo Silva
de Almeida Vera Lucia Lescano
de Andrade Frehse Fabricio
de Aquino Pedro De Podesta Uchoa
de Araujo Filho Joao Antonio
de Araujo Passos Pacheco Bruno Gorini
de Araujo Atila Rodrigues
de Araujo Monica Pacheco
de Assis Montag Luciano Fogaca
de Assis Volpi Thais
de Carvalho Rocha Yuri Gomes Ponce
de Carvalho Daniel Cardoso
de Carvalho Debora Reis
de Carvalho Fernando Rogerio
de Carvalho Mateus Moreira
de Castro Barradas Amauri
de Faria Falcao Jessica Caroline
de Fatima Ramos Guimaraes Tais
de Freitas Danielly Torres Hashiguti
de Freitas Patricia Domingues
de Fries Lucas
de Lima Pereira Karla Dayane
de Lima Felipe Pontieri
de Lucena Carlos Alberto Santos
de Lucena Zilda Margarete Seixas
de Mello Cionek Vivian
de Mello Franco Teixeira
de Menezes Yazbeck Gabriel
de Moraes Pires Walna Micaelle
de Oliveira Barbosa Hugo
de Oliveira Leonardo Brito
de Paiva Affonso Igor
de Paula Santos Rosiane
de Pinna Mario Cesar Cardoso
de Souza Delapieve Maria Laura
de Souza Trigueiro Nicholas Silvestre
de Souza Rafael Couto Rosa
Delariva Rosilene Luciana
Di Dario Fabio
Dias Murilo Sversut
do Amaral Eduardo Cazuni
do Nascimento Bruno Tayar Marinho
do Nascimento Maria Histelle Sousa
do Prado Fernanda Dotti
do Prado Helena Alves
Donascimiento Carlos
Donascimiento Carlos A.
dos Reis Roberto Esser
dos Santos Maroclo Gomes Andrea Cristina
dos Santos Ribas Luiz Guilherme
dos Santos Sousa Kassiano
dos Santos Arthur Alexandre Capelli
dos Santos Juliana Silveira
dos Santos Luciano Neves
Duboc Luiz Fernando
El-Sabaawi Rana W.
Emidio Jr Carmino
Emidio Junior Carmino
Emiliano Thais Moura
Fagundes Patricia Calegari
Fagundes Valeria
Faria Larissa
Fernandes Carlos Alexandre
Ferreira Anderson
Ferreira Beatriz Moreira
Ferreira Fabiane Silva
Ferrer Juliano
Figueiredo-Filho Jesse
Flecker Alexander S.
Florido Rosa
Fonseca Fabiana Luques
Fontoura Nelson Ferreira
Francisco Talitha Mayumi
Franco Ana Clara Sampaio
Freitas Matheus Oliveira
Freitas Pamela Virgolino
Freitas-Souza Diogo
Frota Augusto
Galetti Mauro
Galetti Pedro Manoel
Galiano Daniel
Garcia Diego Azevedo Zoccal
Garcia Thiely Oliveira
Genova Joao Gabriel
Gerhard Pedro
Gertum Becker Fernando
Giovanelli Joao Gabriel Ribeiro
Giraldo Perez Alejandro
Godinho Alexandre Lima
Gomes Fischer Luciano
Gomes Louise Cristina
Gomes Tatyana
Gomez-Uchida Daniel
Goncalves Bruno Bastos
Gonzalez Jessica Antunez
Goyenola Guillermo
Guarderas-Flores Lida
Gubiani Eder Andre
Guimaraes Sales Naiara
Gurgel-Lourenco Ronaldo Cesar
Harrod Chris
Hartmann Caroline
Hartz Sandra Maria
Hawes Joseph E.
Herrera-Madrid Mauricio
Hilsdorf Alexandre Wagner Silva
Hirschmann Alice
Ilha Paulo
Isaac-Nahum Victoria J.
Jarduli Lucas Ribeiro
Jerep Fernando Camargo
Jezequel Celine
Jimenez-Prado Pedro
Jung Aline
Kashiwaqui Elaine Antoniassi Luiz
Kasper Carlos Benhur
Kassner Filho Anderson
Keppeler Friedrich Wolfgang
Kubiak Bruno Busnello
Kuetter Mateus Tavares
Kuetter Vinicius Tavares
Kurchevski Gregorio
Larentis Crislei
Leal Cecilia Gontijo
Lehmann Albornoz Pablo Cesar
Lima Fabio
Lima Luciano Benedito
Lima-Junior Dilermando Pereira
Lima-Junior Sidnei Eduardo
Liotta Jorge
Liotta Jorge
Lobato-de Magalhaes Tatiana
Lopes Ueslei
Loures Raquel Coelho
Lustosa Costa Silvia Yasmin
Machado Carolina
Machado Carolina
Machado Debora Ferreira
Maffei Fabio
Magalhaes Andre Lincoln Barroso
Maia Calebe
Malabarba Luiz Roberto
Manna Luisa Resende
Manoel Pedro Sartori
Mantovano Tatiane
Marinho Jorge Reppold
Martins Lidiane
Martins Renato Tavares
Martins Waldney Pereira
Matthiensen Alexandre
Mazzoni Rosana
Mena-Valenzuela Patricio
Meneses Bruna Arbo
Mincarone Michael Maia
Moraes Djalma Pereira
Moreno Martha Elena Valdez
Munoz-Mendoza Carla
Nakagawa Bruno Kazuo
Nascimento Cristiane A. S.
Negrete Ivan Vinicio Jacome
Nitschke Pedro Peixoto
Nobile Andre Batista
Nobre Andrezza Bellotto
Nobrega Marinho Furtado Shaka
Novaes Jose Luis Costa
Occhi Thiago Vinicius Trento
Okada Edson K.
Oliveira Silva Leonardo
Oliveira Brunno Tolentino
Oliveira Fagner Junior M.
Orlandi Bonato Karine
Orsi Mario Luis
Pascual Miguel
Peixoto Paula
Pera Carolina
Pereira Hasley Rodrigo
Peres Carlos A.
Peressin Alexandre
Petrucio Mauricio Mello
Petry Ana Cristina
Petsch Danielle Katharine
Piana Pitagoras Augusto
Piedade Maria Teresa Fernandez
Pincheira-Ulbrich Jimmy
Pinheiro Ronaldo Fernando Martins
Pinho Henrique Ledo Lopes
Piscor Diovani
Pisicchio Cristina Moreira
Plesley Priscila
Pompeu Paulo Santos
Porto-Foresti Fabio
Pott Crisla Maciel
Prado Ivo Gaviao
Pringle Catherine M.
Prodocimo Viviane
Queiroz Igor Raposo
Quezada-Romegialli Claudio
Quirino Barbara Angelio
Ramirez Jorge Luis
Ramos Telton Pedro Anselmo
Re Reginaldo
Rego Ana Carolina Lacerda
Resende Leonardo Cardoso
Rezende Carla Ferreira
Ribeiro Milton Cezar
Ribeiro Vanessa
Ribolli Josiane
Rivadeneira Juan Francisco
Rizzi Francisco Provenzano
Rocha Elise Amador
Rodrigues Filho Carlos Alberto De Sousa
Rodrigues Leydiane Nunes
Rodrigues Raoni Rosa
Rosa Rafael Rogerio
Rosa Ricardo
Ruaro Renata
Sa-Oliveira Julio Cesar
Salas Johnson Daniel
Salcedo Miguel Angel
Saldanha Barbosa Amanda
Salvador Gilberto Nepomuceno
Sanches Alexandra
Sanchez Alberto J.
Santana Daniel Oliveira
Santos Pablo Henrique Fernandes
Sarmento Soares Luisa Maria
Sartor Natane
Sartorello Ricardo
Schmitter-Soto Juan Jacobo
Schulz Uwe Horst
Severo-Neto Francisco
Shibatta Oscar Akio
Silva Thiago Teixeira
Silva-Santos Rosane
Silvano Renato Azevedo Matias
Sily Maria Cecilia
Smith Welber Senteio
Soares Philip Teles
Solorzano Julio Cesar Jut
Soteroruda Brito Gita Juan
Sousa Hingara Leao
Stefani Marta Severino
Suarez Yzel Rondon
Tagliaferro Marina
Tedesco Pablo A.
Teixeira Adonias Aphoena Martins
Teixeira Francisco Keilo
Teixeira Jessica Vieira
Teresa Fabricio Barreto
Tesitore Giancarlo
Tiburcio Vanessa Graciele
Tobes Ibon
Tonella Livia Helena
Tonini Lorena
Topan Dhyego Hamilton
Torres Parahyba Campos Bruno Augusto
Tufino Paul
Ubaid Flavio Kulaif
Vaini Jussara Oliveira
Valdiviezo-Rivera Jonathan
Viana Douglas
Vicentin Wagner
Vidotto-Magnoni Ana Paula
Vieira Fernando Emmanuel Goncalves
Vila Irma
Villamarin Francisco
Vitorino Junior Oscar Barroso
Vitule Jean Ricardo Simoes
Wojciechowski Juliana
Wojciechowski Juliana
Wolff Luciano Lazzarini
Yanez-Munoz Mario H.
Zandona Eugenia
Publication venue: 'Wiley'
Publication date: 01/01/2023
Field of study

The Neotropical region hosts 4225 freshwater fish species, ranking first among the world's most diverse regions for freshwater fishes. Our NEOTROPICAL FRESHWATER FISHES data set is the first to produce a large-scale Neotropical freshwater fish inventory, covering the entire Neotropical region from Mexico and the Caribbean in the north to the southern limits in Argentina, Paraguay, Chile, and Uruguay. We compiled 185,787 distribution records, with unique georeferenced coordinates, for the 4225 species, represented by occurrence and abundance data. The number of species for the most numerous orders are as follows: Characiformes (1289), Siluriformes (1384), Cichliformes (354), Cyprinodontiformes (245), and Gymnotiformes (135). The most recorded species was the characid Astyanax fasciatus (4696 records). We registered 116,802 distribution records for native species, compared to 1802 distribution records for nonnative species. The main aim of the NEOTROPICAL FRESHWATER FISHES data set was to make these occurrence and abundance data accessible for international researchers to develop ecological and macroecological studies, from local to regional scales, with focal fish species, families, or orders. We anticipate that the NEOTROPICAL FRESHWATER FISHES data set will be valuable for studies on a wide range of ecological processes, such as trophic cascades, fishery pressure, the effects of habitat loss and fragmentation, and the impacts of species invasion and climate change. There are no copyright restrictions on the data, and please cite this data paper when using the data in publications

University of Salford Institutional Repository

University of East Anglia digital repository